知识图表(kg)作为从大型自然语言文本语料库中举行蒸馏信息的伟大工具。查询知识图表的自然语言问题对于这些信息的人类消费至关重要。通常通过将自然语言查询转换为结构化查询,然后在kg上触发结构化查询来解决此问题。在文献中的知识图中直接回答模型很少。查询转换模型和直接模型都需要与知识图表的域有关的特定培训数据。在这项工作中,我们将通过知识图表的自然语言问题转换为前提假设对的推理问题。使用培训的深度学习模型进行转换后的代理推理问题,我们为原始自然语言查询问题提供了解决方案。我们的方法在MetaQA数据集中实现了超过90%的准确性,击败现有的最先进。我们还提出了一种推论称为分层复发路径编码器(HRPE)的模型。可以微调推断模型以跨越跨越培训数据的域使用。我们的方法不需要大型域特定的培训数据来查询来自不同域的新知识图表。
translated by 谷歌翻译
Prior work has shown that coupling sequential latent variable models with semantic ontological knowledge can improve the representational capabilities of event modeling approaches. In this work, we present a novel, doubly hierarchical, semi-supervised event modeling framework that provides structural hierarchy while also accounting for ontological hierarchy. Our approach consists of multiple layers of structured latent variables, where each successive layer compresses and abstracts the previous layers. We guide this compression through the injection of structured ontological knowledge that is defined at the type level of events: importantly, our model allows for partial injection of semantic knowledge and it does not depend on observing instances at any particular level of the semantic ontology. Across two different datasets and four different evaluation metrics, we demonstrate that our approach is able to out-perform the previous state-of-the-art approaches, demonstrating the benefits of structured and semantic hierarchical knowledge for event modeling.
translated by 谷歌翻译
Extensible objects form a challenging case for NRSfM, owing to the lack of a sufficiently constrained extensible model of the point-cloud. We tackle the challenge by proposing 1) convex relaxations of the isometric model up to quasi-isometry, and 2) convex relaxations involving the equiareal deformation model, which preserves local area and has not been used in NRSfM. The equiareal model is appealing because it is physically plausible and widely applicable. However, it has two main difficulties: first, when used on its own, it is ambiguous, and second, it involves quartic, hence highly nonconvex, constraints. Our approach handles the first difficulty by mixing the equiareal with the isometric model and the second difficulty by new convex relaxations. We validate our methods on multiple real and synthetic data, including well-known benchmarks.
translated by 谷歌翻译
The relevance of machine learning (ML) in our daily lives is closely intertwined with its explainability. Explainability can allow end-users to have a transparent and humane reckoning of a ML scheme's capability and utility. It will also foster the user's confidence in the automated decisions of a system. Explaining the variables or features to explain a model's decision is a need of the present times. We could not really find any work, which explains the features on the basis of their class-distinguishing abilities (specially when the real world data are mostly of multi-class nature). In any given dataset, a feature is not equally good at making distinctions between the different possible categorizations (or classes) of the data points. In this work, we explain the features on the basis of their class or category-distinguishing capabilities. We particularly estimate the class-distinguishing capabilities (scores) of the variables for pair-wise class combinations. We validate the explainability given by our scheme empirically on several real-world, multi-class datasets. We further utilize the class-distinguishing scores in a latent feature context and propose a novel decision making protocol. Another novelty of this work lies with a \emph{refuse to render decision} option when the latent variable (of the test point) has a high class-distinguishing potential for the likely classes.
translated by 谷歌翻译
The use of multilingual language models for tasks in low and high-resource languages has been a success story in deep learning. In recent times, Arabic has been receiving widespread attention on account of its dialectal variance. While prior research studies have tried to adapt these multilingual models for dialectal variants of Arabic, it still remains a challenging problem owing to the lack of sufficient monolingual dialectal data and parallel translation data of such dialectal variants. It remains an open problem on whether the limited dialectical data can be used to improve the models trained in Arabic on its dialectal variants. First, we show that multilingual-BERT (mBERT) incrementally pretrained on Arabic monolingual data takes less training time and yields comparable accuracy when compared to our custom monolingual Arabic model and beat existing models (by an avg metric of +$6.41$). We then explore two continual pre-training methods-- (1) using small amounts of dialectical data for continual finetuning and (2) parallel Arabic to English data and a Translation Language Modeling loss function. We show that both approaches help improve performance on dialectal classification tasks ($+4.64$ avg. gain) when used on monolingual models.
translated by 谷歌翻译
关于文本到SQL语义解析的最新研究取决于解析器本身或基于简单的启发式方法来理解自然语言查询(NLQ)。合成SQL查询时,没有可用的NLQ的明确语义信息,从而导致不良的概括性能。此外,如果没有词汇级的细粒度查询理解,查询与数据库之间的链接只能依赖模糊的字符串匹配,这会导致实际应用中的次优性能。考虑到这一点,在本文中,我们提出了一个基于令牌级的细粒度查询理解的通用,模块化的神经语义解析框架。我们的框架由三个模块组成:命名实体识别器(NER),神经实体接头(NEL)和神经语义解析器(NSP)。通过共同建模查询和数据库,NER模型可以分析用户意图并确定查询中的实体。 NEL模型将类型的实体链接到数据库中的模式和单元格值。解析器模型利用可用的语义信息并链接结果并根据动态生成的语法合成树结构的SQL查询。新发布的语义解析数据集的Squall实验表明,我们可以在WikiableQuestions(WTQ)测试集上实现56.8%的执行精度,这使最先进的模型的表现优于2.7%。
translated by 谷歌翻译
尽管基于尖峰神经网络(SNN)的神经形态计算体系结构越来越引起人们的兴趣,作为通往生物学上的机器学习的途径,但注意力仍然集中在神经元和突触等计算单元上。本文从这种神经突触的角度转移,试图探索神经胶质细胞的自我修复作用,尤其是星形胶质细胞。这项工作研究了与星形胶质细胞计算神经科学模型的更强相关性,以开发具有更高程度的生物效率的宏模型,从而准确地捕获了自我修复过程的动态行为。硬件软件共同设计分析表明,生物形态的星形胶质细胞调节有可能在神经形态硬件系统中自我修复硬件现实的故障,具有更好的精度和修复收敛,以实现MNIST和F-MNIST数据集的无监督学习任务。
translated by 谷歌翻译
张量网络是一种用于表达和近似大量数据的分解类型。给定的数据集,量子状态或更高维的多线性图是由较小的多线性图组成的组成和近似的。这让人联想到如何将布尔函数分解为栅极阵列:这代表了张量分解的特殊情况,其中张量输入的条目被0、1替换,并且分解化精确。相关技术的收集称为张量网络方法:该主题在几个不同的研究领域中独立开发,这些领域最近通过张量网络的语言变得相互关联。该领域中的Tantamount问题涉及张量网络的可表达性和减少计算开销。张量网络与机器学习的合并是自然的。一方面,机器学习可以帮助确定近似数据集的张量网络的分解。另一方面,可以将给定的张量网络结构视为机器学习模型。本文中,调整了张量网络参数以学习或分类数据集。在这项调查中,我们恢复了张量网络的基础知识,并解释了开发机器学习中张量网络理论的持续努力。
translated by 谷歌翻译
随着野火产生的大气气溶胶减少了向地球的传入太阳辐射,越来越频繁的野火会显着影响太阳能的产生。通过气溶胶光学深度(AOD)测量大气气溶胶,可以通过地球静止卫星检索和监测AOD数据流。但是,多源遥感数据流通常具有异质特征,包括不同的数据缺失率,测量误差,系统偏见等。为了准确估计和预测潜在的AOD传播过程,存在实践需求和理论利益,以提出一种通过同时利用或融合多种源的异质卫星远程远程远程灵感数据来建模物理信息的统计方法。提出的方法利用光谱方法将多源卫星数据流与控制AOD传播过程的基本对流扩散方程相结合。统计模型中包括一个偏差校正过程,以说明物理模型的偏差和傅立叶系列的截断误差。提出的方法适用于从国家海洋和大气管理局获得的加利福尼亚野火AOD数据流。提供了全面的数值示例,以证明所提出方法的预测能力和模型解释性。计算机代码已在GitHub上提供。
translated by 谷歌翻译
脑出血(ICH)是最致命的中风子类型,死亡率高达52%。由于颅骨切开术引起的潜在皮质破坏,保守管理(注意等待)历史上一直是一种常见的治疗方法。最小的侵入性疏散最近已成为一种可公认的治疗方法,用于体积30-50 mL的深座性血肿的患者,但适当的可视化和工具敏感性仍然受到常规内窥镜方法的限制,尤其是较大的血肿体积(> 50 mL)。在本文中,我们描述了Aspihre的发展(脑部出血机器人疏散的手术平台),这是有史以来的第一个同心管机器人,该机器人使用现成的塑料管来进行MR引导ICH撤离,改善工具敏感性和程序可视化。机器人运动学模型是基于基于校准的方法和试管力学建模开发的,使模型可以考虑可变曲率和扭转偏转。使用可变增益PID算法控制旋转精度为0.317 +/- 0.3度。硬件和理论模型在一系列系统的基准和MRI实验中进行了验证,导致1.39 +\ -0.54 mm的管尖的位置精度。验证靶向准确性后,在MR引导的幻影凝块疏散实验中测试了机器人的疏散功效。该机器人能够在5分钟内撤离最初38.36 mL的凝块,使残留血肿为8.14 mL,远低于15 mL指南,表明良好的后疏散临床结果。
translated by 谷歌翻译